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Abstract 

Two important problems affect the ability of asexual populations to accumulate 
beneficial mutations, and hence to adapt. First, clonal interference causes some 
beneficial mutations to be outcompeted by more-fit mutations which occur in the 
same genetic background. Second, multiple mutations occur in some individuals, 
so even mutations of large effect can be outcompeted unless they occur in a good 
genetic background which contains other beneficial mutations. In this paper, we 
use a Monte Carlo simulation to study how these two factors influence the adapta- 
tion of asexual populations. We find that the results depend qualitatively on the 
shape of the distribution of the effects of possible beneficial mutations. When this 
distribution falls off slower than exponentially, clonal interference alone reasonably 
describes which mutations dominate the adaptation, although it gives a misleading 
picture of the evolutionary dynamics. When the distribution falls off faster than 
exponentially, an analysis based on multiple mutations is more appropriate. Using 
our simulations, we are able to explore the limits of validity of both of these ap- 
proaches, and we explore the complex dynamics in the regimes where neither are 
fully applicable. 



INTRODUCTION 



The accumulation of beneficial mutations drives adaptation and evolutionary innovation. 
Yet despite its central importance, the evolutionary dynamics by which a population ac- 
cumulates such mutations is poorly understood. To better understand adaptation in any 
particular system, we must ask two questions. First, what is the range of beneficial muta- 
tions that are possible given the particular environmental challenge and genetic state of the 
population? Second, given this set of possibilities, what will actually happen probabilisti- 
cally? 

The first of these questions is fundamentally empirical, though Orr and Gillespie have 



argued on general theoretical gro unds that the distribut i on o 

iQsi 



mutations should be exponential ((Gillespie, 



1983 



1991 



fitness effects of beneficial 



Orr , 



2002 



20031 ). A vari- 



ety of recent experiment al studies are roughly consistent with this exponential expec t at ion 



DePristq et al 



2005 



IMHQE and SCHLOTTERER. 



2001 



Kassen and Bataillon 



2006; 



LuNZER et al. 



2005 



ROKYTA et al 



2005 



RozEN et al 



2002 



Sanjuan et al 



20041 ) 



However, beneficial mutations are rare and their fitness effects are difficult to measure pre- 
cisely, so these experimental studies are generally based on relatively few total mutations 
and have correspondingly limited resolution. The tail of the distribution, which refers to 
the rare mutations which confer a very large fitness benefit, is particularly hard to measure. 
Further, the spectrum of beneficial mutations available to a population is likely to vary with 
genetic background, history, and the environment, so it is unclear how far we can general- 
ize from individual experimental studies. Thus it is still unknown whether in general the 
distribution of mutant effects, particularly of large-effect mutations, is exponential. 

Even if we knew the precise distribution of mutational possibilities, it is not clear how 
a population would evolve. Because mutations are random events, there will inevitably be 
some randomness in how a given population adapts. What we would like to understand is the 
statistics of which beneficial mutations are more or less likely to contribute to adaptation, 
and the dynamics by which they do so. That is, given a set of things that are possible, 
what is the probability that any given one of them will actually occur and contribute to the 
adaptation of the population? What is the evolutionary dynamics by which they do so? In 
this paper, we focus on how the distribution of mutations that actually occur and spread 
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through the entire population (i.e. fix), p/(s), depends on the distribution of mutations that 
are possible, p{s), where s is the fitness benefit from a single mutation. We explore these 
features as a function of the population size N and the overall mutation rate U. Besides 
its importance in understanding adaptation, this question is relevant in practical attempts 
to measure the distribution of possible mutations, since p/(s) is much easier to measure 
experimentally than p{s). We also examine some aspects of the dynamics by which the 
mutations that fix do so. 

There are a number of effects that make the distribution of mutations that fix different 
from the distribution of all possible mutations. First, most beneficial mutations that occur 
are lost rapidly by random genetic drift. If a beneficial mutation is particularly lucky, it will 
avoid this stochastic loss and reach a high enough frequency that thereafter its dynamics be- 
come dominated by selection rather then drift. We refer to this process as the establishment 
of the beneficial mutation. Mutations of larger effect are more likely to survive random drift 
— they have a higher establishment probability — so this will tend to bias the distribution 
of mutations that actually fi x towards larger-effect mutations, rela. tive to the distribution of 



mutations that are possible (IHaldaneI . 



1927 



RozEN et al. 



2002). 



Once a mutation has become established, it will fix provided that nothing else interferes. 
However, this fixation takes time, and other beneficial mutations can become established in 
individuals without the original mutation before the original mutation can fix. In an asexual 
population, if one or more of these other mutations has a larger fitness benefit than the origi- 
nal mutation, the original mutation will event ually be ou t -cornpeted and driven to extinction . 



This process i s known as clonal interference ( IGerrishI . 



2001 



Gerrish and Lenski 



WiLKE 



1998; 



20041 ). The same process also operates in a sexual population, where it is referred to 



as the Hill- Robertson effect, but is mitigated because the tw o competing mutations c an po - 
tentially recombine onto the same genome and fix together (IHlLL and Robertson! . Il966l ). 
In this paper, we focus exclusively on asexual populations, where this effect is strongest. 

In a small population with a small to modest mutation rate, the establishment of a bene- 
ficial mutation is an extremely rare event. Thus clonal interference is unlikely to occur, and 
the distribution of mutations that fix is simply the distribution of mutations that establish. 
In a larger population, or one with a higher mutation rate, however, clonal interference can 
be extremely common. Because small-effect mutations are more likely to be interfered with 
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than large-effect mutations, clonal interference biases the distribution of mutations that 
f ix towards those of large effect. This bias has been analyzed in detail both theoretically 



Gerrish and Lenski. 



1998 



DE ViSSER and ROZEN 



20051). 



WilkeI . I2OOJ) and experimentally (IDE VisSER et all Il999 : 



These analyses of clonal interference only consider mutations which occur in the wild- 
type population; they assume that the largest such mutation is the one that fixes. The 
possibility of double mutations in a single organism is neglected. But, in fact, even if 
a more-fit mutation B occurs before an earlier but less-fit mutation A fixes, A may still 
survive, because an individual with mutation A c an get another mutat i on C such that the 



A-C double mutant is more fit than B. Recently, IDesai and FisherI (120071 ) showed that 
whenever clonal interference is important, these multiple mutations are also at least of 
comparable importance — and, in f act, many large ase xual populations will often routinely 
have triple or quadruple mutations (IDesai et a/.l . 120071 ). Because small-effect mutations are 
more common than mutations of larger effect, they are more likely to form double mutants. 
Thus the possibility of multiple mutations biases the distribution of mutations that fix back 
towards those of smaller effect. In short, it will often be the case that getting two small- 
effect mutations is more common than getting a single (rarer) large-effect mutation. This 
effect depends on the shape of the distribution of mutational effects: the rarer large-effect 
mutations are compared to small-effect ones, the stronger the multiple-mutation effect should 
be. The importance of this effect also depends on population size and mutation rate, though 
in a somewhat different way than clonal interference does. 

In addition to affecting the distribution of mutations that fix, multiple mutations also 
have an important impact on the evolutionary dynamics. Different individuals have different 
numbers and strengths of beneficial mutations, so a large population can maintain substan- 
tial variation in fitness. It is only those mutations that occur in the most-fit individuals 
which have the best chance of surviving and contributing to the long-term adaptation of the 
population. Thus the dynamics of adaptation are slowed down, limited by the rate at which 
good mutations occur in good backgrounds. 

Because the distribution of beneficial mutations which fix depends in a subtle way on both 
clonal interference and multiple-mutation effects, it cannot be fully understood without a 
complete model which includes both. No analytical results from such a model yet exist. 
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though IKim and OrrI ( l2005l ) have analyzed a model which includes some aspects of both 
effects. In this paper, we address this question using Monte Carlo simulations of the full 
evolutionary dynamics of large asexual populations, including both clonal interference and 
multiple-mutation effects. We consider several distributions of possible mutational effects 
p(s), and determine the distribution of mutations that fix, p/(s), across a range of popu- 
lation sizes and mutation rates. We find that clonal interference analysis provides a good 
approximation for some aspects of Pf{s) when large-effect mutations are sufficiently com- 
mon relative to small-effect ones. When large-effect mutations are mo re rare, we find that 



Desai and Fisher 



an ap proximation focusing on multiple mutation effects, proposed by 
(120071 ). can be more appropriate. 

We next turn to the evolutionary dynamics by which beneficial mutations fix. Using 
our Monte Carlo approach, we simulate the dynamics of the full model with a distribution 
of fitness effects. We show that multiple-mutation dynamics involving mutations within a 
narrow range of fitness effects describes the evolution. We describe how this range of fitness 
effects depends on p(s) and the other parameters, and how the mutations within this range 
accumulate. 



MODEL AND SIMULATION METHODS 

We consider an asexual population of haploid individuals with an overall mutation 
rate Ub towards beneficial mutations. Our model also applies to asexual diploids, where the 
fitness effects of mutations refer to their effects in the individual in which they occur. Given 
that a beneficial mutation occurs, we assume that its fitness effect is s (i.e. the fitness of 
the organism increases by a factor of e^) with probability 

where the two parameters a and (3 characterize the shape of the distribution and F is the 
Gamma function. This form for p{s) allows us to explore the importance of the shape of 
the tail of the distribution of mutant effects — that is, the relative rareness of large-effect 
mutations compared to small-effect ones. When j3 = 1, the distribution of mutant effects 
is exponential with mean a. When /? > 1, the distribution of mutant effects falls off faster 
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than exponentially (i.e. large-effect mutations are more rare), and the effect of a "typical" 
beneficial mutation is a. When (3 < 1 the distribution of mutation effects falls off more 
slowly than exponentially (large-effect mutations are more common), and again the effect of 
a typical beneficial mutation is cr. Note that this distribution is normalized to 1, so beneficial 
mutations with effect between s and s + ds occur at a rate Ubp{s)ds. The average fitness 
effect of a beneficial mutation, (s), is 

= "f(TM' 

SO while cr is always a typical effect of a beneficial mutation, it is only exactly equal to the 
average effect for j3 = 1. We assume that there is no epistasis, so that an individual with two 
mutations of effect Si and S2 has fitness e*^^*^ (or more generally for n mutations, the fitness 
is riiLi^'*')- We neglect deleterious mutations, as these are not expected to qualitatively 
affect the dynamics in large pop ulations when benef i cial rautations are relatively common, 



2007 



RouziNE et al. 



2OO3I). 



which is the situation we study (IDesai and FisherI . 

We assume a dynamics with discrete generations. In each generation, we first randomly 
select which individuals will survive to the next generation, weighted by each individual's 
fitness. The overall survival probabilities are normalized so that on average half of the 
population will survive to the next generation. Each surviving individual then duplicates to 
create two identical individuals in the next generation. Finally, each of these individuals in 
the next generation has a probability of acquiring a new beneficial mutation. We then 
repeat this algorithm for the subsequent generation. We record all the information about 
the genetic state of the population at each step. All simulations were checked to ensure that 
the results were extracted after a steady-state had been achieved. Note that this algorithm 
does not enforce an exact population size at each step, but rather keeps the average 
population size equal to A^. 

These evolutionary dynamics have the advantage of being fast to simulate, allowing us to 
explore a greater range of parameters in a reasonable amount of computation time. However, 
they are slightly different from standard Wright-Fisher dynamics. It is not clear which of 
these models is the best representation of any particular population, but our model might 
be expected to correspond to a bacterial or yeast population which divides by binary fission. 
The differences between models do lead to small differences in establishment probabilities. 
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and thus are expected to cause minor modifications to the evolutionary dynamics. However, 
we have also simulated the standard Wright-Fisher dynamics for many of the parameters 
we describe in this paper, and the results are all qualitatively similar. 



RESULTS 



The Distribution of Mutations that Fix 



We begin by studying the distribution of mutations that fix, p/(s), as compared to the 
distribution of mutations that occur, p{s). When and Ub are sufficiently small, neither 
clonal interference nor multiple mutations will occur. Thus the distribution of mutations 
that fix equals the distribution of mutations that establish. We expect 

Pf{s) = n{s)p{s), (3) 

valid for small N and Ub, where 7i{s) is the establishment probability. Note that by conven- 
tion we will assume p/(s) is not normalized; this will make various calculations much more 
transparent. In our model, this establishment probability is given by 

1 - 

^^") = 1 _ e-27v. > (4) 

assuming that s is the fitness advantage relative to the background population. When 
Ns <^ 1 we have 7r(s) ~ while for Ns ^ 1 but s ^ 1 we have 7r(s) ~ 2s. This small- 
population limit of Pf{s), which is the distributio n of mutation s that establish, is sometimes 



called the distribution of contending mutations (IRozEN et a/.l . 120021 ). We will denote it by 
Pc{s). 

In larger populations, we expect clonal interference to suppress the fixation of small- 
effect mutations. Thus for small s, we expect p/(s) to be smaller than Pc{s). But clonal 
interference cannot suppress the fixation of the largest-effect mutations, so if it were the only 
important effect we would expect that for large s, P/(s) should equal Pc(s). Provided that 
small-effect mutations are common enough relative to those of lar ger effect, however, raultiple 



small-s mutations can suppress the fixation of large-s mutations. IDesai and FisherI (120071 ) 
suggest that this will occur whenever the distribution of mutation effects falls off faster than 
exponentially (i.e. when /3 > 1). When this is true, we expect that these multiple mutation 



effects will cause to be smaller than Pc{s) even for large s. When /3 < 1, large- 

effect mutations will typically not be out-competed by multiple smaller-effect mutations, so 
Pf{s) = Pc{s) for large s, though multiple mutations will often still be important to the 
dynamics of adaptation. 

In Fig. 1, we show several examples of the distribution of fixed mutations compared 
to the distribution of possible and contending mutations, p{s) and Pc{s) respectively. The 
predictions of clonal interference analysis are also shown (see Discussion). We see that the 
above expectations are met. For small populations (Fig. la), p/(s) = Pc{s) except for 
very small s (these s are so small that clonal interference prevents them from fixing even 
in these small populations). We find similar results for small populations regardless of f3 
(data not shown). For larger populations, the behavior depends on (3. In all cases, small- 
effect mutations are suppressed quite dramatically by clonal interference effects (Fig. Ib-d). 
Note, however, that clonal interference analysis predicts that this suppression of small-effect 
mutations should be even stronger than we observe. Presumably this is because some of these 
small-effect mutations are able to fix in multiple-mutation combinations together with those 
of larger effect. For large /5, the largest-effect mutations are also suppressed, presumably by 
multiple-mutation effects (Fig. lb). 

Which Mutations Contribute To Adaptation 

To determine how beneficial mutations of different effects contribute to the overall adap- 
tation of the population, we need to weight mutations by their fitness effects. That is, a 
single fixed mutation with effect 2s contributes the same amount to the adaptation of the 
population as two mutations with effect s. Thus R{s) = spf{s) is the distribution of the 
relative contributions to the overall adaptation as a function of s. The integral of R{s) from 
Si to S2 is the total contribution of mutations of size between si and S2 to the adaptation. 

Using R{s), we can study which mutations are most important. We expect that mutations 
of very small effect will not contribute substantially to adaptation, because they are strongly 
suppressed by clonal interference and do not contribute much even when they do fix. On 
the other hand, mutations of very large effect will be too rare to contribute substantially, 
and may be impeded by multiple smaller mutations. Thus we expect that typically R{s) 
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will have a peak at some intermediate value of s, with some range of mutations around this 
that contribute substantially to adaptation. This is indeed what we find. We characterize 
R{s) by its mean, which we call s. In practice, this mean is a good estimate of the "peak" of 
R{s). We estimate the width of the range of mutations around the mean which contribute 
substantially to the evolution by the standard deviation of R{s) around s, which we call 
SD{s). 

In Fig. 2, we show how s in our simulated populations depends on and Ub for sev- 
eral different values of (3. We compare these simulated results to the predictions o f clonal 
interference analysis alone and, for (3 > 1, the analysis of IDesai and FisherI ( j2007l ). which 
combines elements of clonal interference and multiple mutations in an ad-hoc way (see Dis- 
cussion) . 

In Fig. 3, we show SD{s)/s as a function of and Ub, again for several values of 
f3, compared to the predictions of clonal interference analysis. Note that SD{s)/s ^ 1 
corresponds to the case where a narrow range of mutations around s dominate the evolution. 
We see that this is typically the case, regardless of whether the distribution of mutational 
effects falls off faster or more slowly than exponentially, except for small N or Ub- 



The Dynamics of Adaptation 

We now turn to the dynamics by which these important mutations accumulate. We 
have seen that multiple mutations are important in determining s for /? > 1, but not so 
important for (3 < 1. Despite this, mutations of effect around s, which dominate the overall 
adaptation of the population, may accumulate via multiple-mutations dynamics even when 
/3 < 1, although mutations of much larger effect fix whenever they establish. 

These multiple-mutations dynamics are easier to understand intuitively in an idealized 
model where all mutations confer the same fitness advantage s. If this were the case, we 
could describe the state of the population as a distribution of the number of such mutations 
each individual has. Some lucky most-fit individuals have more than the average number 
of such mutations, and it is only additional mutations within this small subpopulation that 
will contribute to the long-term evolution of the population; others will eventually go extinct 
because they are handicapped by their relatively poor genetic background. We define the 
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lead, q, to be the number of such mutations the most-fit individual possesses in excess of the 
average individual (more precisely, q—1 is defined to be the difference in number of mutations 
between the most-fit class of established individuals and the mean individual). Several 
studies have analyzed the accumulation of beneficial mutations when multiple mutants are 



important (i.e. q > 1) , and found that these multiple mutations drarnatica. 



evolutionary dynamics (IDesai and FisherI . 120071 : IRouzine et all boosl . boosi ). 



ly affect the 



When mutations have a range of fitness effects, the dynamics are clearly more complex. 
Yet as we have seen, mutations in some narrow range around some s dominate the evolution. 
It is therefore natural to expect that multiple mutations of effect of order s may routinely 
appear, and that their accumulation be described roughly by the single-s model, with s 
chosen to be s, and the mutation rate Ub the overall mutation rate to mutations within 
roughly ±SD{s) of s. We can define the lead q in this more complex situation as the fitness 
of the most-fit individual minus the fitness of the mean individual, divided by s. That is, q 
is the number of extra s-sized mutations that the most-fit individual has compared to the 
average individual (more precisely, we define g — 1 as the fitness of the most-fit established 
class minus the average fitness, divided by s). 

In Fig. 4, we show how this q depends on and Ub for several different values of j3. We 
see that even for /3 < 1, these multiple mutations are important to the dynamics (where 
values of g = 3 to 5 are reached) . Note that the original clonal interference analysis implicitly 
assumes no multiple mutations. Thus it implies that q is between 1 and 2, depending on 
whether an established mutant group is sweeping to fixation. 

As beneficial mutations accumulate, the population adapts. We define the rate of adap- 
tation, V, to be the rate at which the average fitness of the population increases. In Fig. 5, 
we show how this v depends on and Ub, again for several values of (3. We compare these to 



the pr edictions of clonal interference theory alone and to the analysis of 



Desai and Fisher 



( 120071 ) (see Discussion). 



DISCUSSION 

Our Monte Carlo simulation approach allows us to study the evolutionary dynamics of 
adaptation in large asexual populations, where the effects of both clonal interference and 
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multiple mutations interact in a subtle way. Analytic results are difficult to obtain in this 
complex situation, and hence such analysis has been confined to approximations that focus 
primarily on one or the other effect. Using our simulations, we can assess the usefulness and 
generality of these methods. 

In its original form, which neglects the possibility of multiple mutations, clonal interfer- 
ence analysis predicts the dist ribution of mutations that contribute to ada ptation, and the 



dynamics by which they do so (IGerrish and Lenski 



1998 



WiLKE 



2004 ) . In this analysis 



the probability that a beneficial mutation fixes is t he probability that it es t ablish es and then 



Gerrish and LenskiI (119981 ) found that 



fixes before another more-fit mutation establishes, 
given that a mutation of effect s has established, the expected number of more-fit mutations 
establishing before the original mutation fixes is roughly 



In AT 

S Js 



'n-{x)p{x)dx, 



(5) 



assuming that all mutations arise in the wild-type population. Thus the distribution of 
mutations that fix is 

Pf{s) = vr(.)e'^Wp(.). (6) 

In Fig. 1, we compare this prediction to the results of our simulations. From Eq. ([6]) it 
is also straightforward to calculate the expected s and SD{s). Because clonal interference 
becomes more likely as either population size or mutation rate increases, increasing either 
of these parameters is expected to increase s. These predictions are shown in Figs. 2 and 3. 

The average rate at which mutations fix equals the rate at which those destined to fix 
occur. Thus clonal interference analysis predicts that the average fixation rate {k) is 



{k) = NUb / Pfis)ds. 

•J 



This means that the rate of adaptation v is 



(A:)(.), 



(7) 



(8) 



where (s) is the average fitness of mutations that fix. This prediction is shown in Fig. 5. 

For /3 = 0.5 and /3 = 1, we see from Fig. 2 that clonal interference yields reasonable 
estimates of s. This makes sense, as in this regime multiple mutations do not suppress the 
fixation of larger-effect mutations. For f3 = 10, clonal interference systematically overesti- 
mates s, by an amount which increases with the population size and mutation rate. This 
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also makes sense, as in large populations for (3 = 10 multiple mutations in fact suppress the 
fixation of large-effect mutations. 

Although clonal interference accurately predicts s for small (3, we can see from Fig. 4 that 
for all values of j3, q is greater than 2 when and Uf, are large, pointing to the importance 
of multiple mutations in the dynamics. Clonal interference analysis, by contrast, assumes 
that all mutations occur and fix in the wild-type population, which implies q between 1 
and 2. This underlies the calculation of the fixation rate in Eq. ([7]), which assumes that 
mutations in any individual can contribute to adaptation, when in fact for g > 1 it is only 
the mutations in the relatively rare individuals that already have other beneficial mutations 
which contribute. Thus we expect that neglecting the importance of multiple mutations 
should lead clonal interference analysis to overestimate the rate of adaptation v, for all (3. 
This is indeed what we find for (3 = 0.5, but we see from Fig. 5 that clonal interference 
analysis accurately predicts f for /? = 1, and actually underestimates the rate of adaptation 
for P = 10. 

The reason for this discrepancy is apparent from Fig. 3, which shows that for (3 = 1 and 
(3 = 10, clonal interference tends to underestimate SD{s). This problem gets worse as and 
Ub increase. The reason for this underestimate is that clonal interference assumes that the 
largest mutation that occurs before any other fixes goes to fixation by itself. This strongly 
suppresses the fixation of mutations which have a substantially smaller fitness effect, and 
leads to a prediction of a very small SD{s). But in fact, mutations with a variety of smaller 
effects will sometimes happen to occur in individuals that have this larger-effect mutation, 
and these will sweep to fixation together. This broadens the distributions of mutations that 
fix, and hence increases the actual SD{s), as is apparent in Fig. 3 and in the distributions 
shown in Figs, lb and Ic. This means that clonal interference assumes that only mutations 
within a much narrower range of fitness effects contribute to adaptation than is actually 
the case, which should lead to underestimates of the rate of adaptation. In other words, 
although only mutations that happen in very fit individuals can contribute (which slows 
adaptation), many mutations of various effects occur in these fit individuals and can all fix 
together (speeding adaptation). This underestimate of v is more severe for larger (3, because 
larger f3 corresponds to a larger underestimate of SD{s). We see from Fig. 5 that for (3=1, 
this underestimate of v roughly cancels the overestimate of v caused by the assumption that 
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mutations in any individual can contribute to adaptation. For (3 = 0.5, the underestimate 
of SD{s) is less severe, so it only partially cancels the overestimate, and in sum clonal 
interference overestimates the rate of adaptation. For (3 = 10, the reverse is true. 

Together, these results suggest that clonal interference analysis is the right framework for 
estimating s whenever (3 < 1. For (3 = 0.5, it also gets the distribution of mutations that 
fix roughly correct, while for /? = 1 it underestimates SD{s), and hence misunderstands the 
distribution of mutations that actually fix. For /9 = 10, it does not accurately predict either 
s or the shape of the distribution of mutations that fix. Although it gives accurate estimates 
of V for /? = 1, it is apparent that for all (3 clonal interference remains an incomplete picture 
of the dynamics; multiple mutations are also important in a variety of ways. 



An alternative framework, which 
posed by 



bcuses primarily on multiple-mutation effects, was pro- 



Desai and Fisher] ( 120071 ). These and other auth ors studied a mode 



all beneficial rn u tations have the same fitn e ss ad vantage s ( IDesai and Fisher 



RiDGWAY et al. 



where 



1998 



RouziNE et al. 



20081 . 



2007 : 



20031 ). They calculated the rate at which 



these mutations ac c umula te, f(s), as a function of population size and mutation rate. 



Desai and FisherI (120071 ) then argued that in a more general situation where beneficial 



mutations have a range of fitness effects, under many conditions SD{s) should be small 
compared to s, so that mutations within a narrow range of fitness effects dominate the 
evolution. Thus the single-s model describes the full dynamics, provided that one chooses 
that single s to be s, and chooses the beneficial mutation rate to these mutations to be the 
total mutation rate towards all mutations within roughly SD{s) of s. This should be true 
as long as SD{s) is relatively narrow — at most of order s. Our simulations show that this 
i s indeed the case (Fig. 3), as do recent exp erimental studies in S. cerevisiae and E. coli 



( IDesai et al. 



2007; 



Hegreness et al 



2006f ). 



Of course, the value of s and the width of the range of mutations which contribute to 
the evolution depend on p(s), A^, and the overall mutation rate. A full understanding of 
this depends in a subtle w ay on both clonal interference and multiple-mutation effects, but 



Desai and FisherI (120071 ) proposed a simple approximation. They first calculate v{s) for 



each possible value of s, assuming that only mutations of this size are possible. To do this, 
one must specify an app r opria te mutation rate to mutations of this size, which we refer to as 



Desai and Fisher 



(120071 ) made the ad-hoc assumption that this mutation rate should 
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be the total mutation rate to mutations of order s (i.e. within roughly a factor of 2 of s). 
They then calculated v{s). This v{s) expresses the contribution of mutations of effect s to 
the overall evolution, and thus should equal R{s), up to normalization. From this R{s), they 
calculate s. They find 



a 



where i is related to the overall mutation rate by 



(9) 



-In 



U,, 



(10) 



_ar(l + l//3)_ 

This expression for s is only valid for /? > 1; for distributions of mutational effects that fall 
off exponentially or slower the behavior is more complicated and the analysis breaks down. 

Both this approximation and the original clonal interference analysis predict that s should 
increase with A^, because increasing the population size increases the probability of clonal in- 
terference but does not dramatically change the relative importance of large effect mutations 
relative to multiple smaller-effect ones. On the other hand, for large /3, multiple-mutation 
effects should cause a qualitative shift in the relationship between s and Ui,. Clonal interfer- 
ence analysis alone predicts that s increases with Ub, because higher mutation rates make 
clonal interference more common. However, higher mutation rates also increase the impor- 
tance of multiple small-effect mutations relative to large-effect ones. From Eq. we see 
that for f3 > 1 this should mean that s actually decreases with Ub for large Ub- Although the 
effect is small, we do observe this decrease in our simulations (Fig. 2a). For small (3, on the 
other hand, we expect that multiple mutation effects typically do not impede the fixation of 
large-effect mutations. Thus clonal interference alone gives a qualitatively accurate picture 
of how s depends on and Ub, as observed. 

Given s, and assuming that the appropriate mutation rate is that towards mutations 
of this order, the dynamics of adaptation are similar to that of the corresponding s i ngle- 5 
model. Using Eq. (Q and their analysis of the single-s model, IDesai and FisherI (120071 ) 
calculated how the rate of adaptation v and the lead q should depend on A^ and Ub and the 
shape of the distribution of mutational effects. They found 

^^21n|/V.103-l)_ ^^^j 
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and 

WW C,.(^^. (12) 

As with Eq. ([9]), these expressions are only vahd for f3 > 1. 

We compare these theoretical predictions to our simulation results in Figs. 2, 3 and 5. We 
see that for large N and Ub they give qualitatively the correct behavior for s, the lead q, and 
the rate of adaptation v, though they do systematically overestimate q and v (see below). In 
this regime, these results are more accurate than those given by clonal interference analysis. 
Two qualitative features are particularly important: that q can become larger than 2, and 
that s actually decreases as Ub increases when j3 > 1. For small and Ub, corresponding 
to g < 2, the multiple- mutations results are less accurate, a s expected because the above 
results are valid only when q > 2 (IDesai and FisherI . 120071 ). 

Eq. (ITT]) and Eq. (fT2l) rely on the ad-hoc assumption that the appropriate mutation 
rate to the mutations around s that dominate the dynamics is the total mutation rate to 
mutations of order s (i.e. within roughly a factor of 2 of s). As we have found here, SD{s) is 
often much smaller than s, so the range of mutational effects that contributes substantially 
to the evolution is much smaller than assumed. This means that the intuitive picture behind 
the multiple-mutations model of the dynamics is correct: there is indeed a narrow range of 
mutations which contribute to the evolution, and their accumulation is well-described by 
a corresponding single-s model of the dynamics. However, the estimate of the appropriate 
mutation rate is too large, since a narrower range contributes than was assumed. This means 
that the predictions of q and v in Eq. ( ITTl) and Eq. ( fT2l) should be overestimates, as we 
observe. To correct these overestimates, we need to understand what determines SD{s). 
Unfortunately neither clonal interference analysis nor the multiple- mutations approach does 
this well; it remains an important topic for future analytical work. 

While the clonal interference and the multiple-mutations approximations together help 
us to form a more complete understanding of the dynamics, both leave much to be desired. 
Clonal interference processes appear to be the main determinant of s when /5 < 1, as 
demonstrated by Fig. 2. But even for these small (3, we see that q is often larger than 2, 
and hence clonal interference analysis gives the wrong picture for the dynamics. Further, it 
misses the possibility of smaller-effect mutations fixing together with those of effect roughly 
s, and hence drastically underestimates SD{s) even for (3 = 1. On the other hand, while 
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the multiple-mutations analysis provides the right picture of the dynamics for /3 > 1, and 
accurately predicts s, q and v, it provides no way to estimate SD{s), nor any specific 
predictions when (3 < 1. The simulation approach we have taken in this paper sheds 
some light on where and why these two different approaches work, and has highlighted 
the regimes where neither provides a satisfactory picture. A more detailed understanding 
will require analysis of a general model which explicitly incorporates both clonal interference 
and multiple mutations, to produce a theory which has the correct picture of the dynamics 
in these difficult regimes. 

An interesting result of our simulations is that the general shape of the distribution of 
the mutations that contribute to adaptation, R{s), is rather universal. R{s) is always a 
relatively narrow distribution with a clear peak at some s. The distribution of mutations 
that fix, Pf{s), has a similar shape with a sharp peak near s. This means that if we do a 
single experiment at a given population size and mutation rate, the observed p/(s) will not 
provide much information about the underlying p{s). This lack of sensitivity of experimental 
adaptation to the d istrib ution of mutational effects has been noted in a related context by 



Hegreness et al. 



(120061 ). However, our simulations also show that the scaling of various 
aspects of (such as s) with population size and mutation rate does depend strongly 

on p{s). Most important is the shape of the tail of as we have seen, the way in which 
s depends on and Ui, is strongly dependent on (3. Thus careful experiments which are 
carried out at a range of population sizes or mutation rates may make it possible to infer 
important aspects of p{s) from measurements of p/(s) or the rate of adaptation v. 

As our simulations make clear, the actual values of (3 applicable to natural asexual pop- 
ulations are of central importance to the dynamics by which these populations adapt. The 
values of (3 found in natural populations may also tell us something about the evolutionary 
history of these populations. Orr and Gillespie have argued that if an individual is at a 
random high-fitness genotype, the dist r ibution of more-fit genotypes is exponential, so we 



should expect (3 = 1 ( IGillespie , 



1983 



1984 



1991 



Orr 



2002 



20031 ) . However, as a pop- 



ulation adapts it is natural to expect p{s) to change. For example, a population may face 
a static challenge, and gradually deplete the available beneficial mutations as it adapts. If 
the population were small enough that p/(s) = 2sp(s), the distribution p(s) should converge 
to an exponential as this adaptation progresses. But if clonal interference and multiple- 
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mutation effects cause large-effect mutations to be depleted much faster than small-effect 
ones, we expect that (5 should increase as the available mutations become depleted. Thus 
large observed values of P may be indicative of this type of adaptation. Of course, this 
increase in j3 could be avoided if the accumulation of beneficial mutations tends to open up 
new possibilities for further adaptation. In other words, the structure of fitness landscapes 
has an important role in determining typical values of and how these change as popu- 
lations adapt. Given these values of /3, our simulations provide a way to understand, in a 
statistical sense, which mutations will tend to contribute to adaptation, and the dynamics 
by which they will do so. 
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FIG. 1 Examples of the distribution of possible mutations p{s) (solid black line), the distribution 
of contending mutations pds) (solid red line), and the distribution of fixed mutations Pf{s) from 
our simulations (yellow histogram). Also shown is the distribution of mutations which occurred in 
the simulations (transparent histogram) . Additionally the distribution of predicted fixed mutations 
Pf{s) from a clonal interference calculation are shown (solid blue line). Note the logarithmic scale. 
In all examples, a = 0.02. (a) A small population, where Pf{s) = pds) except for the smallest- 
effect mutations. Here = 3 x 10^, Ub = 10~^, and (3 = 1.0. (b) A large population with 
(3 = 10. Here N = 10^ and Ui, = 10"^. Note that small-effect mutations are suppressed by clonal 
interference effects, while large-effect mutations are suppressed by multiple mutation effects, (c) A 
large population with /3 = 1. Here N = 10^ and Ub = 10~^. Note that small-effect mutations are 
suppressed by clonal interference effects, but less strongly than clonal interference analysis alone 
predicts, (d) A large population with /3 = 0.5. Here A = 1 x 10^ and Ub = 10^^. 
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FIG. 2 The average scaled fitness effect of mutations that fix, s. In all cases a = 0.02. (a) 
Simulation results for s for = lO'^ as a function of Uj, for /? = 0.5 (closed black circles), /3 = 1 
(open red circles), and /? = 10 (open blue squares). Predictions of clonal interference analysis are 
shown as dotted lines, and the predictions of the multiple mutation analysis (for /3 = 10) are shown 
as a solid line. Note that for /? = 10, s decreases with for large C/5. (b) Simulation results for 
s for Uh = 10~^ as a function of iV for /? = 0.5 (closed black circles), f3 = 1 (open red circles), and 
P = 10 (open blue squares). Predictions of clonal interference analysis are shown as dotted lines, 
and the predictions of the multiple mutation analysis (for /3 = 10) are shown as a solid line. 
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FIG. 3 The scaled width of the weighted distribution of mutations that fix, SD(s)/s. In all cases 
a = 0.02. (a) Simulation results for SD{s)/s for = lO'^ as a function of Uf, for (3 = 0.5, /? = 1, 
and P = 10. Predictions of clonal interference analysis are shown as dotted lines, (b) Simulation 
results for s for Uf, = 10~^ as a function of for /? = 0.5, /3 = 1, and P = 10. Predictions of clonal 
interference analysis are shown as dotted lines. 
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FIG. 4 The effective lead q. (a) Simulation results for g as a function of Ui, for /3 = 0.5, /3 = 1, and 
P = 10. Other parameters are as in Fig. 3a. Predictions of the multiple-mutations analysis (for 
[3 = 10) are shown as a solid blue line. Note that clonal interference predicts g ~ 1, independent 
of the parameters, (b) Simulation results for q as a. function of for (3 = 0.5, /3 = 1, and (3 = 10. 
Other parameters are as in Fig. 3b. Predictions of the multiple-mutations analysis (for [3 = 10) 
are shown as a solid blue line. As before, clonal interference predicts g ~ 1, independent of the 
parameters. 
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FIG. 5 The average rate of adaptation, v. (a) Simulation results for u as a function of Uj, for 
P = 0.5, /3 = 1, and /3 = 10. Other parameters are as in Fig. 3a. Predictions of clonal interference 
analysis are shown as dotted lines, and the predictions of the multiple-mutations analysis (for 
P = 10) are shown as a solid blue line, (b) Simulation results for v as a function of N for /? = 0.5, 
/? = 1, and P = 10. Other parameters are as in Fig. 3b. Predictions of clonal interference analysis 
are shown as dotted lines, and the predictions of the multiple-mutations analysis (for (3 = 10) are 
shown solid blue line. 
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